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Abstract 

We present a new model for seed banks, where direct ancestors of individuals may have 
fvq lived in the near as well as the very far past. The classical Wright-Fisher model, as well as 

7-H a seed bank model with bounded age distribution considered by Kaj, Krone and Lascoux 

(2001) are special cases of our model. We discern three parameter regimes of the seed bank 
age distribution, which lead to substantially different behaviour in terms of genetic vari- 
ability, in particular with respect to fixation of types and time to the most recent common 
ancestor. We prove that for age distributions with finite mean, the ancestral process con- 
verges to a time-changed Kingman coalescent, while in the case of infinite mean, ancestral 
lineages might not merge at all with positive probability. Further, we present a construction 
of the forward in time process in equilibrium. The mathematical methods are based on 
i—^ renewal theory, the urn process introduced by Kaj et al., as well as on a paper by Hammond 

P^ and Sheffield (2011). 
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csj 1 Introduction 

> 

In this paper we discuss a new mathematical model for the description of the genetic variability 

of neutral haploid populations of fixed size under the influence of a general seed bank effect. In 

l/~j contrast to previous models, such as the Kaj, Krone and Lascoux model [6], we are particularly 

£T} interested in situations where ancestors of individuals of the present generation may have lived 

in the rather remote past. 

Seed banks are of significant evolutionary importance, and come in various guises. Typical 
situations range from plant seeds which fall dormant for several generations during unfavourable 
. ,_i ecological circumstances |111 112], fruit tissue preserved in Siberian permafrost |13j , to bacteria 

turning into endospores if the concentration of nutrients in the environment falls below a certain 
threshold. Such endospores may in principle persist for an unlimited amount of time before they 
become active again (see, e.g. [2]). Seed bank related effects can be viewed as sources of genetic 
novelty [7] and are generally believed to increase observed genetic variability. 
In [6], a mathematical model for a (weak) seedbank effect is investigated, with the number of 
generations backwards in time that may influence the current population being bounded by a 
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constant m and being small when compared to the total population size (resp. during passage 
to a scaling limit). Under such circumstances, it is then shown that the ancestral process of the 
population can be approximately described by a time-changed Kingman coalescent, where the 
(constant) time change leads to a linear decrease of the coalescence rates of ancestral lineages 
depending on the square of the expected seedbank age distribution. Overall, genetic variability 
is thus increased (in particular if mutation is taken into account), but the qualitative features 
of the ancestral history of the population remain unchanged. 

In the present paper, we consider the ancestral process of a neutral seed bank model with 
Wright-Fisher-type dynamics, assuming constant population size N. However, the distance 
measured in generations between direct ancestor and potential offspring will not assumed to 
be bounded, but rather sampled according to some (potentially unbounded) age distribution fx 
on N. For /j, = Si, we recover the ancestral process of the classical Wright-Fisher model, and 
scaling by the population size yields a Kingman coalescent as limiting ancestral process. For 11 
with bounded support, say with a maximum value m, independent of N, we are in the setup 
of |6j, and obtain a time change of Kingman's coalescent appearing in the limit (again after 
classical scaling). 

Yet, some species suggest (i.e. bacteria transforming into endospores) that \i could be effec- 
tively unbounded, in particular non-negligible when compared to the population size. This can 
lead to entirely different regimes. 

Our first result is that if li has finite expectation, we again obtain a time-changed Kingman's 
coalescent after classical rescaling. The behaviour of the model however changes completely if 
we assume 11 to have infinite expectation. A natural example for age-distributions is a discrete 
measure \i with a power-law decay, that is 

/j({n,n+l,...})=n- a L(n) 

for some a > and some slowly varying function L. Depending on the choice of a, we investigate 
the time to the most recent common ancestor (MRCA) of two individuals, if it exists. It turns 



out (Theorem 2.2) that for a > 1/2, there is always a common ancestor, but the expected time 
to the MRCA is finite if a > 1 and infinite if a < 1. If a < 1/2, any two ancestral lineages never 
meet at all with positive probability. 

In the following section, we construct our model and present the main results. The proofs 
are given in Section 3. 

2 Model and main results 

We work in discrete time (measured in units of non-overlapping generations) and with fixed finite 
population size N E N. Time in generations is indexed by Z. The dynamics of the population 
forwards in time is given in the following way: Each individual chooses the generation of its 
father according to a law li on N, meaning that \x{n) gives the probability that the immediate 
ancestor of an individual of generation i has lived in generation i — n. We call li the seedbank 
age distribution. To avoid technicalities, we will always assume //({l}) > 0. After having chosen 
the generation, the individual picks the father unifomly among the N possible ancestors from 
that generation. 

For concreteness, we will often assume that the age distribution li is of the form fi = Li a , 
with 

fi a ({n, n+ 1,...}) =n~ a L(n), n £ N, 

for some a S (0, oo) and some slowly varying function L. Let T a := {fx a }, a £ (0, oo) denote the 
set of all measures li of this form. We are interested in the question of whether or not in such 



a population a genetic type eventually fixates, and if this happens in finite time almost surely. 
In the backward picture, this is related to asking if a finite set of individuals has a most recent 
common ancestor and when it lived. 

It turns out that in the above construction an ancestral line can be described by a renewal 
process with interarrival law [i. The question of existence of a common ancestor and time to the 
most recent common ancestor can therefore be investigated via classical results of Lindvall [8J 
on coupling times of discrete renewal processes, which are controlled in the power law case via 
applications of Karamata's Tauberian Theorem for power series, see e.g. pp. This leads to three 
different regimes, see Theorem |2.2[ If on the other hand one is interested in the forward in time 
process, mathematical modelling problems arise: In order to obtain a new generation of such 
a population, one requires information about the whole history, i.e. needs to start sampling 



at '— oo'. In subsection 2.2 we present a construction of such a population in equilibrium, 
which allows to study the correlations of the allele frequency process. This construction can 
be formalized in terms of Gibbs measures, following a paper of Hammond and Sheffield |3], 
where the case N = 1 is considered in order to construct a discrete process with long-range 
correlations that converges to fractional Brownian motion. This is sketched in the appendix. 

2.1 Renewal construction of ancestral lineages and time to the most recent 
common ancestor 

We start with a descripton of the ancestral lineages of samples in our model in terms of renewal 
theory. Fix N £ N and a probability measure \i on the natural numbers. Let v E Vn '■= 
Z x {1, ...,N} denote an individual of our population. For v £ Vn we write v = (i v ,k v ) with 
i v G Z, and 1 < k v < N, hence i v indicating the generation of the individual in Z, and k v the 
label among the N individuals alive in this generation. 

The ancestral line A(v) = {vo = v, v±, V2, ■ ■ ■} of our individual v is a set of sites in Vjv, where 
i Vo ,i Vl , ■ ■ ■ 4- -co is a strictly decreasing sequence of generations, with independent decrements 
iv t — ivi_ 1 ='■ Vi, I > 1 with distribution /i, and where the k Vo , k vi , . . . are i.i.d. Laplace random 
variables with values in {!,..., N}, independent of {i^jzeNo- Letting 



S n ■= ^Vl, 
1=0 

where we assume So = rjo = 0, we obtain a discrete renewal process with interarrival law ji. 
In the language of [9], we say that a renewal takes place at each of the times S n ,n > 0, and 
we write (q n )nen f° r the renewal sequence, that is, q n is the probability that n is a renewal time. 

It is now straightfoward to give a formal construction of the full ancestral process starting 
from N individuals at time in terms of a family of TV independent renewal processes with 
interarrival law \x and a sequence of independent uniform random variables U r (i),i G — N, r G 
{1, . . . , N}, with values in {1, . . . , N} (independent also of the renewal processes). Indeed, let 
the ancestral processes pick previous generations according to their respective renewal times, and 
then among the generations pick labels according to their respective uniform random variables. 
As soon as at least two ancestral lineages hit a joint ancestor, their renewal processes couple, 
i.e. follow the same realization of one of their driving renewal processes (chosen arbitrarily, 
and discarding those remaining parts of the renewal processes and renewal times which aren't 
needed anymore). In other words, their ancestral lines merge. 

Denote by P N the law of the above ancestral process. For v G Vn with i v = 0, we have 

q n = P N (A(v)n{{-n}x {!,..., N}) /0), (1) 



P»(w g A(v)) = -t 



and the probability that w G Vjv is an ancestor of v, for i w < i v , is given by 

1 

N' 

For notational convenience, let us extend q n to n G Z by setting q n = if n < 0. Note that 

<7o = 1. 

In |6j it was proved that if a has finite support, then the ancestral process, rescaled by 
the population size, converges to a time-changed Kingman-coalescent. Our first result shows 
that this remains true with the same classical scaling for u with infinite support, as long as 
it has finite expectation. We consider the ancestral process of a sample of n < N individuals 
labelled vi,...,v n sampled from generation k = 0. We define the equivalence relation ~/% on the 
set {l,...,n} by 

i~ k j^ A(vi) n A( Vj ) n ({-k, ..., 0} x {1, ..., N}) / 0, 

that is % ~fc j if and only if Vi and Vj have a common ancestors at most k generations back. Let 
A/v,n(&) denote the set of equivalence classes with respect to ~^, which is a stochastic process 
taking values in the partitions of {l,...,n}. Let E := {l,...,n}, and let De[0,oo) denote the 
space of cadlag functions from [0, oo) to E with the Skorohod topology. 

Theorem 2.1. Assume E^[r/i] < oo. Let f3 := E L i ■ As N — > oo, the process (A/v,n(|_~jpJ))t>o 
converges weakly in De[0,oo) to Kingman's n—coalescent. 

Two individuals v, w € Vm have a common ancestor if and only if A(v) n A(w) ^ 0. If this 
is the case, and if v and w belong to the same generation, we denote by r the time to the most 
recent common ancestor, 

r := inf{n > : A(v) D A(w) n ({-n} x {1, ..., iV}) / 0}. 

Clearly, the law of r is the same for all v, w with i v = i w . 



Theorem 2.1 implies that if /x has finite expectation, two randomly sampled individuals have a 
common ancestor with probability 1, and the expected time to this ancestor is of order N. If 
the expectation does not exist, this changes completely. Let us now assume that fi G r a , which 
means that the tails of /j, follow a power law. Our second result distinguishes three regimes: 

Theorem 2.2 (Existence and expectation of the time to the most recent common ancestor). 
Let fi G T a and let v, w G Vn, V ^ w 

(a) Ifae (0, 1/2), then P^{A(v) D A(w) / 0) < 1 for all N G N, 

(b) Ifae (1/2, 1), then P^{A(v) n A(w) / 0) = 1 and E%[t] = oo for all N G N. 

(c) Ifa>l, then P^(A(v) n A(to) / 0) = 1 /or all N G N, and mmy^ ^ll = ^, w '^ 

In other words, for a > 1/2 two individuals almost surely share a common ancestor, but the 
expected time to the most recent common ancestor is finite for a > 1 and infinite if a G (1/2, 1). 
Hence in real-world populations observed over realistic time-scales, for a G (1/2, 1) (or even for 
a G (1,2) where the mean, but not the variance of fi exists), the assumption that a population 
is in equilibrium has to be treated with care. 

Remark 2.3. In the boundary case a = 1, the choice of the slowly varying function L becomes 
relevant. If we choose L = const., then it is easy to see from the proof that £^[t] = oo. The 
case a = 1/2 also depends on L and requires further investigation. 



2.2 Forward in time process 

Having obtained a good idea about the ancestral process, we would now like to study the forward 
picture. For this it is useful to construct the whole bi-infinite genealogy of the whole population 
at once, which can be done as a spanning forest of a suitable vertex set. We consider graphs - 
in fact trees - with vertex-set Vn = Ij N and a set of bonds En which will be a (random) subset 
of Bn := {(v,w) : v,w G Vjv} where the edges are directed. For v G Vn we write as before 
v = (i v ,k v ) with i v G Z, and 1 < k v < N. We consider the set of directed spanning forests of 
Vn, which we can write down as follows: Let 

Tn '■= {G = (Vn,En) '■ En C Bn s.th. Vt> G Vn, 3! it; G Vn,i w < iv, with e = (w,v) G -Eat}. 

This means, we consider trees where each vertex v has exactly one outgoing (to the past) edge, 
which we denote by e v . This unique outgoing edge, or equivalently, the unique ancestor of v 
is determined as follows. Let {r] v } v( zv N be a countable family of independent \i— distributed 
random variables, and let {U v } ve y N denote independent uniform random variables with values 
in { 1 , . . . , N} independent of the r] v . This infinite product measure induces a law on Tn if we 
define 

e v ■= ((iv -r} v ,U v ),v). 

We denote this probability measure by P N . In words, the ancestor of v is found by sampling 
the generation according to \i, and then choosing the individual uniformly. We see that 



1 

N' 



P N (ev = (w,v) £ E N ) = —/j,(i v -i w ). (2) 



Comparing this to our previous construction of the ancestral process, we realise that P N can be 
considered as being the restriction of P N to situations regarding the ancestry of a sample, and 
hence, with slight abuse of notation, we will identify the two measures, dropping the notation 
P N . A tree G G Tn is interpreted as the ancestral tree of the whole bi-infinite population. 



Remark 2.4. Note that for /j, G r a it follows from Theorem 2.2 that G G Tn has only one 
connected component almost surely if a > 1/2, since two individuals belong to the same con- 
nected component if and only if their ancestral lines meet. If a < 1/2, then G has infinitely 
many connected components almost surely, since in that case any two individuals belong to two 
disjoint components with positive probability by Theorem 2.2 (a). 

Having obtained a construction of the genealogy of the population for all times, we can now 
for exampe introduce genetic types. We take the simplest situation of just two types. Let the 
individual v G Vn have type X v G {a, A}, and assume a neutral Wright-Fisher reproduction, 
that is, types are inherited from the parent. This means that in the above construction, indi- 
viduals belonging to the same component of the tree have the same type. In particular, in the 
case a > 1/2, everyone in the population has the same type. This is clear, since constructing 
the whole tree at once means that we are talking about a population in equilibrium, meaning 
that fixation of one of the two types has already occurred. However, in the case a < 1/2 the 
tree has infinitely many components almost surely, and therefore both types can persist for all 
times. We can assign to each component independently type a with probability p G [0, 1] and 
type A otherwise. For each p £ [0, 1] this procedure defines a probability measure on {a, ^4} N . 

Definition 2.5. Let A^ denote the probability measure on {a, A} N which, given G G Tn, 
assigns each connected component of G independently type a with probability p, and type A 
otherwise. 



Remark 2.6. It can be shown, following [4j, that in a certain sense the measures A^ are the 
only relevant probability measures on {a, A} N consistent with the dynamics of our population 
model. We make this precise in the appendix. For now, we just assume that the type distribution 
of our population is given by A^. 

We can now introduce the frequency process: Let v := (i,k) G V/v, that is, i denotes the 
genration of the individual, and k its label among the N individuals of generation i. Let 

1 N 

Y N{i) ■= jy5j 1 {X ilh =a}- 

Our construction allows us to easily compute some correlations for the frequency process of the 
seed bank model. Recall q n from the last section. 

Theorem 2.7. Let X = X P N . 

(a) E x [Y N (i)]=pVi£Z, 

(b) If fi£T a with a > 1/2, cov x (Y N (0),Y N (i)) =p(l-p) Vi G Z,VN G N, 

(c) If /i G r a with a G (0, 1/2), we have limjv-^co cov^(Y/v(0), Yn(i)) = 0, 

C(i) := lim corrx(Y N (Q), Y N (i)) G (0, 1) for all i G Z, 

N— s>oo 

and, as i —> oo, for some constant c and some slowly varying function L, 

rd\ (l-a) 2 -p(l-p) ,-2a-i r /,-N 

CW ~r(2-a) 2 r(2a)(E^o^ + l)' Z W ' 

where ~ means that the ratio of the two sides tends to 1, and the sum occurring in the 
denominator is finite. 

Remark 2.8. If a > 1/2, we have that corr^(Y/v(0), Yjv(?)) = 1. This is clear, since in this 
case all individuals have the same type, and E\{Y^{i)) = p, y&Tx(Y^(i)) = p(l — p) and 
corr A ((y JV (0),lAr(i)) = l. 

3 Proofs 

3.1 Proof of Theorem EO 



The proof of Theorem 2.1 follows ideas of [6j, which we combine with a coupling argument relying 



on renewal theory. In certain steps we have to take particular care of the unboundedness of the 



support of the measure //, these steps are carried out with particular care in Lemmas 3.1 and 



3.2. Recall that for Theorem 2.1 we assumed that the expectation of the renewal process exists, 

i.e. E^[r]i] < oo, which in the case \x G r a holds for a > 1. For the case a = 1, finiteness of the 

expectation depends on the choice of the slowly varying function L. 

We first introduce an 'urn process' similar to the one introduced in [6], for measures /i with 

potentially unbounded support. The point is that our ancestral process An can then be realised 

as a simple function of this urn process. 

Keep N fixed. For 1 < n < N let 



oo 

S n ■= Uxi,X2,...),Xi G No, y^Xj = n\. 



j=l 



For n £ N we construct a discrete-time Markov chain {X n (k)}k£N with values in S n that we 
will refer to as the n-sample process. Let X n (0) = (X? (0), X^(0), ...) be such that \X n (0)\ = n. 
We think of Xf(§) £ {0, .., n} as the number of balls currently placed in urn number i. Later, 
urns will correspond to generations, balls to individuals. The transition from time k to time 
k + 1 is made by relocating the X™(k) balls in the first urn in a way that is consistent with 
the ancestral process of our seed bank model, and shift the other urns including their contained 
balls one step to the left: Let a : M N — > M N : (xi,X2,...) >->■ (x2,X3,...) denote the one-step 
shift operator, and, for I £ N, let R(l) be an 5;— valued random variable which is multinomially 
distributed with infinitely many parameters: 

J2(0~Mult(l;M(l),M2),»-), 

i.e. R(i) is a random vector of infinite length, and Ri(l) counts the number of outcomes that 
take value i in I independent trials distributed according to /U. Define 



X n {k + 1) = a(X n (k)) + R(X?(k)), A; = 0,1,.. 



(3) 



By definition, X n = {X n (k)}k£ is a Markov chain with (countably infinite) state space S n 
(see Figure 1). 
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Figure 1: Transition from X n (k) (top line) to X n (k + 1) (bottom line): All the balls in urn number 1 
are relocated independently according to \i. 



It provides a construction of n independent renewal processes with interarrival law /j,, if 
one keeps track of the balls. For our purpose, it suffices to note that X™(k) gives, for each k, 
the number of renewal processes that have a renewal at after k steps, which is equal in law to 
the number of original individuals in our seed bank model that have an ancestor in generation 
— k. Now recall our ancestral process {Aj\[ tn (k)} from Section 2, which was constructed using 
coalescing renewal processes. In terms of the X n — process it can be described as follows: Think 
for the moment of each of the urns as being subdivided into N sections. We start with n balls 
and run the X n — process. At each relocation step, each ball which is relocated to urn i + 1 
is put with equal probability into one of the A^ sections in urn i + 1. All balls that end up in 
the same section within an urn are merged into a single ball (Figure 2). Since this results in a 
decrease in the total number of balls, say from n to n' < n, after a merger event, we continue 
to run accroding to a Markov process with law C(X n ) with n' balls, and so on. Denote by 
{X N ' n (k)}keN the well-defined process obtained by this procedure. The number of balls present 




Figure 2: The possible types of coalescence events in the X N - n — process: A coalescent event in urn 2 
induced by a ball landing in an occupied place, a coalescent event in urn 4 due to two balls landing in 
the same empty place, and no coalescence in urn i + 1 although it holds several balls. 



at time k in this process is equal in law to the block-counting process of our ancestral process 
started with n sampled individuals: 



X»*{k)\±\A N Jk)\. 



Unlike An, the process X ' n = {X ' n (k)}ken is a Markov chain in discrete time with 
countable state space U" =1 Si. Of course, it is also possible to define an exchangeable partition 
valued process as a function of X N,n , where balls correspond to blocks (we refrain from a formal 
definition, in order to keep the notational effort reasonable). 

An important step is to observe that for each n, the corresponding urn process X n has a 
unique invariant distribution. Indeed, let 



ft 



n{i,i + 1,...} 

EM 



This fraction is well-defined since we assumed En\rj\ < °°- Denote by v n := Mult(n, fa, fa, ...) 
the multinomial distribution with success probabilities fa. We claim that this is the stationary 
distribution for the n— sample process X n . From classical renewal theory, we know that z/ 1 is 
the stationary distribution in the case n = 1 (see [9]). For n independent renewal processes we 
have (cf. [6]): 

Lemma 3.1. If E^[rj\ < °°; then v n is the stationary distribution for X n , and X n is positive 
recurrent for all n £ N. 

Proof. We reduce the proof to the finite case discussed in J6j . For each j £ N we define 

1 



fij({i}) :-- 



TLMii}) 



l{i<j}M«), * G N. 



This defines a probability measure fj,j with support {l,...,j}. Clearly, limj^oo fij(i) = fi(i) for 



all i, and lim^oo E M . [ 



Ey\r}\ by monotone convergence. 



Let y nj = (y nj ' (k))keM be the Markov chain constructed in the same way as X n , but 
with relocation measure \ij instead of \i, that is, Y n,3 {k + 1) = a(Y n,:) (k)) + R J (Y™' J (k)), where 
R j (l) ~ Mult(/;//j(l), ...,fij(j)), and with Y n ^{Q) = X n {0). Define now 



Pi* 



E H [V] 



Clearly, lim^oo fi\ = ft Vi £ N. Let v^ := Mult(n; ft], f3 J 2 , •••) the multinomial distributions 
on S n with success probabilities fi\. By Lemma 1 of [6] we know that v^ is the stationary 
distribution for Y nj . Fix x,y £ S n . By construction, 

F(X n (l) = y | X n (0) = x) =P(R( Xl ) = y- a{x)) = lim P(R j (xi) = y- a{x)) 

(4) 
= lim P(Y n >3(l)=y\Y n 'J(0)=x). 

For x G 5„, let ^ := max{j : Xj / 0}. Note P(X n (l) = y \ X n (0) = x) = for all x such that 
jx > jy + 1- We write P v n for the distribution of {X n {k))k<^n with initial distribution v n . Then, 
for every y £ S n , 

P yn {X n (l) = y)= Y, v n (x)P(X n (l) = y\X n (0) = x) (5) 

X&S n ,jx<jy + l 

= lim V v"(x)P(Y n (l) = y \ Y nj (0) = x) (6) 

xes n ,j x <j y +i 

= lim ^(y) = u n (y). (7) 

So Mult(n; /3i, f3 2 , •••) is a stationary distribution for X™. By irreducibility it is unique, and X n 
is positive recurrent. □ 

Recall the dynamics of the process X N,n = (X N ' n (k))keN from above. We first compute 
the probability of a coalescence given that we are in a fixed configuration. Define the events 

-B/,fc := {exactly I mergers at time k in X ' n } 

and 

B>i t k := {at least I mergers at time k in X ,ra }, 

for 1 < I < n and k £ N. 

Lemma 3.2. Fix N £ N, n < N, and fj, such that E^fr/] < oo. With the notation of the last 
section, 



P(B 1>k+1 | X N > n (k) = (*i,* 2 , ...)) = 1 f; La^CO + f* 1 ), 



M(»r +°(^ ) (8) 



and there exists < c(n) < oo, depending on X >n only via n, such that 

P{B> 2>k+1 \X N > n (k) = (x x ,x 2 ,...)) < ^. 

Proof. We start with computing the probability of a coalescence in a fixed urn i £ N given 
X N ' n (k) = (X?' n (k),X?> n (k),...) and R(X?> n (k)) = (R 1 (X^ n (k)),R 2 (X^ n (k)), ...). The 
probability for having exactly one coalescence occurring in urn i (note that from k to k + 1 we 
shift all urns by 1) is 

±X^(k)R t (X^(k)) + ^( Ri{X \ m ) -Pii), 

where p(i) = p{i, X N,n (k), R(X™' (k))) is the probability that more than one coalescence hap- 
pens in urn %. Here, the first term is the probability that we see at least one coalescence due to 



one of the relocated balls falling into an already occupied section of urn i, and the second term 
is the probability of seeing at least one coalescence due to two relocated balls falling into the 
same section of urn i. Observe that p(i) is 0(N~ 2 ). More precisely, writing 

M % := X^{k)R t {X^ n {k)) + W^'W 



it is easy to see that, because each ball being moved to urn i has a probability of at most -^ to 
merge at all, 



p(i) < 



n 

n 2 ' 



rN,n, 



and therefore, since given X > n {k) and R(X 1 ' n (k)) there are at most n occupied urns, 

,5 



E^W 



< 



i=i 



n 
N 2 ' 



rN,n, 



Further, given X ,n (k) and R(X" ,n (k)), the probability of having at least two mergers at step 
k + 1, which occur in two different urns i and j, is 

Moreover, for fixed X N ' n (k) and R{X X ,n (k)), we have the trivial bound Yl'jLi Mj < 2n 3 . This 
implies 






N 2 



i=1 j-j¥=i 



4n 6 

w 



Thus the probability of seeing exactly one coalescence in step k + 1, given X ,n (k) and R(X 1 ,n (k)) 



is 



E b Mj " p(i) -7V2 5Z M ^- = ^E M * +o(n- 2 ) 

i=l ^ ' «,j'=i i=l 



-AT,n/ 



Computing R(X"' n (k)) given X N ' n (k) using the multinomial distribution, we obtain 
P(B ltk+1 | X N ' n (k) = x) = E ^(Bi.fc+1 1 ^'"W = x, R(x) = r)P(R(x) = r | X N > n (k) = x) 



r£S n 



N 



r£S„ U=l V \ / / 

IE (x i+1 xMi)+ (^(O 2 ) +0(N- 2 ) 



N,n, 



P(R{x)=r\X"' n (k) = x) 



(9) 



where we have used that 

E 0(N- 2 )P(R(X^ n ) = r | X N > n (k) = x) = 0{N- 2 ) 



r£S n 



since the 0(N 2 ) term is bounded uniformly in r £ S n by some ^r, and we average with 
respect to a probability measure. This proves the first claim. We have seen that 

oo 1 OO / \ 

P(B> 2M1 I X N >»{k), R(X^ n (k)) = EpW + ^E M ^ M ^ CV ' * 



i=i 






N 2 ' 
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This proves the second part. 

□ 

We now have the ingredients to prove convergence to Kingman's coalescent. 



Proof of Theorem 2.1 Fix n G N. We will first study the process started in the sta- 
tionary distribution v Then we will extend the result to arbitrary initial distributions using an 
adaptation of Doeblin's coupling method. To prove convergence in the stationary case, we just 
need to prove that the inter-coalescence times for binary mergers are distributed asymptotically 
exponential with rate fi\ Q) , and that multiple coalescences are negligible. Starting from the 
stationary distribution, the probability of seeing a coalescence in the next step given that we 



have currently n balls is obtained as in [BJ, using Lemma 3.2 

P{B ljk+1 | X N > n (k) ~ v n ) = E„« [P{B ljk+1 | X N > n (k))} 



PKAL^Pi 



N \y\ u ^ U ) ' (10) 



N K 2> +0{N 

where we have computed the expectations with respect to the multinomial distribution v n 

and used 2 YZi %^(0 + TZi M*) 2 = 1- 

We have seen before that multiple coalescences happen with negligible probability. Hence if 
we speed up time by a factor N, we obtain for the inter-coalescence times 

?2 /„\ \ Nt 



lim P(no coalescence in X N > n before time Nt) = lim ( 1 - ^ ( ) + 0(N- 2 ) ) = e - ^)*. 



For the coupling argument, we consider now a process X ,n which runs as follows: Start with 
n balls in the stationary distribution u n , and let it evolve according to the n— sample dynamics. 
After each coalescence event, sample a new starting configuration according to v n , where n' 
is the number of balls present after the coalescence, and run the process according to the 
n'— sample dynamics. Assume now that X N,n starts in a given initial distribution. Define 

rW := inf{t > : X N ' n (t) = X N ' n (t)}. 

We couple X N,n and X N,n as follows. Colour the balls of X N,n red and the balls of X N ' n blue. 
Label both the red and the blue balls 1, ..., n. Recall that the dynamics of our urn process just 
consists in moving balls from urn one independently from each other to a new urn according to 
//, and merging balls in the same urn with probability j? per pair. Run the red and the blue 
process independently. Let us first assume that no coalescences occur in either of the processes. 
Now if at some time k, the red ball number i and the blue ball number i happen to be in the 
same urn (but not necessarily in the same section), we couple them and let them move together 
from this time onwards. Denote by a", the time of this coupling. Note that ctj is finite almost 
surely, since it is the coupling time of two renewal processes . Then we continue running our 
processes until all the balls have coupled. Let T coup := max{<7i,l < i < n}. Note that this 
time is independent of N. Since n is fixed, and the different balls move independently, we have 
P (Tcoup < oo) = 1 no matter which initial distributions we choose (see [9], chapter II), and 
hence 

lim P(T coup >t)=0. 

t— >oo 

11 



Speeding up time by N, the coupling happens much faster than the coalescence: Let T^, oal be 
the time of the first coalescence in either the red or the blue process. At each time step, the 
probability of having a coalescence in the next step is bounded from above by the crude uniform 
estimate n 2 /N. Hence 



/ 2\ ViV 

lim ^(T c 2 > v 7 ^) > lim 1-^ =1 

N— >oo N— >oo \ 1 V / 



Since 
we get 
This implies 



lim P (T coup < VN) 



< N ) ^ HP \ ^ I- vlrAN) 



lim P{ T cli > Tcoup) > lim P(T^J > VN,T coup < VN) 



from which we see 



lim P{TW + Tcoup ) = lim P{T^ < T coup ) = 0, 



lim P(T (JV) > Nt) = lim P(T coup > Nt) = 0. 

Af^-oo v ' Af-s>oo v ' 



Hence we can restart our process X ,n after each coalescence event, and the two processes will 
couple with probability 1 before the next coalescence takes place, and indeed on the coalescent 



time scale (time sped up by N) the coupling happens instantaneously. Using (11) we thus 
obtain for the inter-coalescence times of the process started in an arbitrary but fixed initial 
configuration 

lim P(no coalescence in X N ' n before Nt) =e~^^) t . (12) 

N— >oo 

This implies as before by standard arguments that \X ,n (Nt)\ converges weakly as N — > oo to 

the block-counting process of Kingman's coalescent. Since \X ,n (Nt)\ = \Aj^ jn (Nt)\, and the 
fact we obviously have exchangeability of the ball configurations, we even obtain the convergence 
to Kingman's n— coalescent in the obvious sense. □ 

Remark 3.3. It appears remarkable that E^[r/] < oo is sufficient for this result. If E^^ 2 ] = oo, 
and Y denotes the label of the urn that a ball is placed in, then E v n[y] = oo and by [Sj, 
E[5] = oo. However, due to the time rescaling, the fact that P(S < oo) = 1 is enough for our 
purpose. 

Remark 3.4. In order to show convergence to Kingman's coalescent, we could also follow the 
approach of [6|, which uses Mohle's Lemma |1U] to show convergence of finite dimensional 
distributions. Note however that in our case the state space of the Markov chain is infinite, 
hence the transition matrices are infinite. Indeed, denoting the transition matrix of X N > n by 
n^v = {IiN{x,y)}x,yeu a ± 1 s j : w e can decompose n.jv as n^ = A + ^B + 0(N~ 2 ), where A is 
given by the transitions of the X n — processes without coalescence, and B contains adjustments 
that need to be made to the X n — process in case of a single coalescence event (compare [6]). The 



higer order coalescences are 0(N 2 ) by Lemma 3.2 To apply Mohle's Lemma it is sufficient 
to show that P := lim m _>oo A m and G := PBP exist. We first take care of the part without 
coalescence. Let A be defined by A(x,y) := YTj=i l {x, y as n } A n{x,y), where (A n (x,y))( x ^ eS ™ 

yields lim^oo A*(x,y) = v n (y) for all 



3.1 



denotes the transition matrix of X n . Then Lemma 

x,y G S n . Therefore we obtain linim^oo A m = P, where P = (P(x,y)) Xtye s with P(x,y) = 

Y^j=\ ^{x,yes } y:> (y)- We can now define B as the matrix of the single coalescence events as in 
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[6j. That is, if x £ Si, y 6 Si— 1, then B(x,y) is the probability that the balls from configuration 
x are relocated according to the matrix Ai, and that exactly one pair of them coalesces, so that 
we end up with configuration y. If x € Sj, then B(x,y) = if y ^ <% U Sj_i. If x and y are 
in Si, then B(x,y) gives the correction for the X 71 — process in case of a coalescence, therefore 
B{x,y) > —A{x,y) in this case. Hence B has the same block form as in [6J, however, the single 
blocks are of infinite size. Furthermore, ||£?|| = max ieU "_ Si^2 y \B(x,y)\ < 2. Since P is a 
projection, G = PBP is a bounded operator, and therefore e ,t S R, exists as a convergent 
series. Now the computations work in exactly as in the case of bounded support, hence we 
obtain the convergence to Kingman's coalescent following the proof of [6j. □ 

Remark 3.5. Note that Mohle's result allows the following heuristic interpretation of our limiting 
process X ,n as N — > oo. First, the process, for each number of 'active' balls n' < n, mixes 
rapidly and essentially instantaneously enters its stationary distribution on the configuration 
with n' balls. Note that as long as there is no coalescence event, any future evolution does 
not affect the block counting process A^^ n , and also not the corresponding partition- valued 
process, where each 'active' ball denotes a block in a partition of {!,... n} consisting of all 
labels of balls that have merged into this active ball. Now, in each 'infinitesimal time step', our 
limiting process picks an entirely new state from its stationary distribution, independent of its 
'previous' state (this is the effect of the projection operator P). In a way it can be regarded as 
a 'white noise' process on the space of stationary samples. While this process obviously has no 
cadlag modification, both the block counting process, and the partition valued process, remain 
constant until there is a new merger, and are thus well-defined (recalling that such mergers, 
that is, transitions from n' active balls to n' — 1 active balls, happen at finite positive rate in 
the limit). 

3.2 Proof of Theorem [2721 

Recall from section 2 that the time to the most recent common ancestor is related to the coupling 
time of two versions of the renewal process. Recall 



q n = P&(A(v) n ({-n} x {1, ..., JV}) # 

We will need some bounds on the q n that can be obtained via Tauberian theorems. 
Lemma 3.6. Let fj, G T a . 

(a) Let a S (0, 1). Then 



n=0 



1 — a 
T(2-a)r(l + a) 



i a L(i) as i — > oo, 



(b) The sum 



n=0 



2 



is finite if a £ (0, 1/2) and infinite if a > 1/2. 
(c) Let a e (0,1/2). Then 

(! " «) 2 „. 2 «-i 



J2 qnqn - 



r(2-a) 2 r(2a) 

n=0 
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% a L(i) as i —7- oo. 



Proof. The proof of this lemma can be found in [4J, Lemma 5.1. □ 



Proof of Theorem 2.2 We first prove (c), which corresponds to the case where we have 
convergence to Kingman's coalescent. Without loss of generality, assume i v = i w = 0. Denote 
by (R n ) and (R' n ) the sequences of renewal times of the renewal processes corresponding to v 
and w respectively, that is, R n = l{ n e{So,Si,— }}" ^ n °ther words, R n = 1 if and only if v has an 
ancestor in generation — n, and q n = P(R n = 1)- Let 

T := inf{n : R n = R' n = 1} 

denote the coupling time of the two renewal processes. Since each time v and w have an ancestor 
in the same generation, these ancestors are the same with probability N, we get 

E[t] = NE[T]. 

But if a > 1, we have that E^[r]i\ < oo, and therefore by Proposition 2 of [8], E[T] < oo. The 
result now follows from Theorem |2.1| and the fact that the expecte time to the most recent 
common ancestor of n individuals in Kingman's coalescent with time change f5 2 is given by 

n 



E[TMRCA] = Wh~W) = J 2 V~n) 



k=2 \2) 



hence for n = 2 we get -^ ■ 

(b) For independent samples R and R' , the expected number of generations where both 
individuals have an ancestor, is given by 



oo 

In i 
n=0 n=0 n=0 



E[J2R n R' n ] =Y,E[R n ]E[R / n ] = J^ql 



which is infinite if a > 1/2 due to Lemma 3.6 (b). Each of these times, the ancestors are 



the same with probability 1/iV, therefore with probability one A(v) and A(w) eventually meet. 
However, the expected time until this event is bounded from below by the expectation of the 
step size, 

E^[t]>E[ V ] = oo 

if a < 1. 

(a) In this case, E [ Yl^Lo RnR' n ] = Yl^=o Qn < °°; an d therefore 



P{Y j R n R! n = oo) =0, 



n=0 

which implies that the probability that A{v ) and A(w) never meet is positive. 
3.3 Proof of Theorem l2?rl 



We prove now Theorem 2.7 We define Y v := l{x„= a }- 



Lemma 3.7. Let A = A^, and assume fi G T a . 
(a) If a > 1/2, 



cov\(Y v , Y w ) = p(l - p), 
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(b) If a 6 (0,1/2), v^w, 

cov\(Y v ,Y w ) =p(l -p) 



Eoo 
n=0 Qn(ln+i v -i w 



Proof. We have 

E^(Y V Y W ) = \(X V = X w = a)= P P£{A(v) n A(w)) / 0) +p 2 (l - P&(A(v) D A(w) / 0)) 
and E% (Y V )E* (Y w ) = p 2 . This implies 

cov x (Y v ,Y w )=p(l-p)P^(A(v)nA(w) / 0). 

If a > 1/2, then Pfc(A(v) n ^4(u>) / 0) = 1 which proves (a). Hence we need to compute 
Pft(A(v) n A(w) 7^ 0) for a < 1/2. To do this, let S n , S' n denote two independent samples of the 
renewal process, with Sq = i v , S' = i w . Note that this implies for the times of the renewals that 



P(Rn = 1) 



ln+i v • 



Recall that the renewal process is running forward in time, whence the ancestral lines are traced 
backwards. Let A v and A w denote two independent samples of the ancestral lines of v and w, 
using the processes S and S' respectively, without coupling the processes. Then the expected 
number of intersections of A v and A w is given by 



1 oo 1 oo 

E[\A v nA w \] = -E[ Y, R n R' n ]=- Y, q n+ i v q n +i w 

oo ^ l6 > 

= T7 / j Q.nQ.n+i v -i w i 
n=0 

On the other hand, conditioning on the event that the ancestral lines meet (which clearly 
has positive probability) , and then restart the renewal processes in the generation of the first 
common ancestor, which is the same as sampling two ancestral lines starting at (0, 0), 

E[\A V n A w \] =E[\A V n A w \ | A v n A w ^ %]P{A V n A w / 0) 

=p(A(v) n A(w) + m[\\ m n A m \] 



--P{A{v)nA(w)?Q)U + ±f^A 



Recalling qo = 1 this implies 

P£(A(v)nA(w)f 



Eoo 
,, ,. ,. , _ ,., , _ n=0 Qnqn+i v - 



which proves the Lemma. □ 



Proof of Theorem 2.7[ (a) is obvious and (b) follows from Lemma 3.7 For (c), let a £ (0, 1/2). 



Lemma 



3.6 



tells us that Yln^o In < °°- From Lemma 3.7 it follows that for i ^ 



Eoo 
n=0 QnQn- 

'N + ^Zill 



cav\(Y N (0),Y N (i)) = p(l - p) v „ x L> 
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For the variance we obtain 



var A ( 



1 N 
(y JV (z))=— Y^ COV A (%*;), F(i J)) 



N 2 

k,j=i 

:^ ATp(l - p) + N(N - l)p(l - p) ^=°J n 2 



=p(l - p) 



E^oC + i-W 



iV + E~ i 4 



Hence 



con x (Y N (0),Y N (i)) 



Eoo 
n=0 WW 



which converges as iV — > oo. The result now follows from Lemma |3.6| (c). 



□ 



Appendix: Gibbs measure characterization of the forward 
process 



In section 2.2 we claimed that the measures A^ are in a certain sense the only measures de- 



scribing the type distribution which are consistent with the dynamics of our process. In order 
to make this rigorous, we use a Gibbs measure characterization, which relies on the approach 
of 0]. In order to construct the Gibbs measure, we start with prescribing the distribution of 
types conditional on the (infinite) past. Let Sn := {a, A} N denote the finite dimensional state 
space. Let X v = Xr ivjk \ G {a, ^4} denote the type of individual v that is the fcth individual 
of generation i. We denote by C the sigma-algebra of cylinder events, and write a n for the 
a— algebra generated by cylinder sets contained in {...., n}. For i £ Z, we define the probability 
kernel Ajv,i(-|-) from {Sf;,ai) to (Sfj,C) by saying that for any finite set B C {i + 1, ...} , and 



xb £ {a,A} B , and for £ G S 



-i,i} 



*N,iWB 



the conditional probability 
xb) ■= \N,i({ x \s = xb} I 



is obtained by first sampling G G T/v > tracing back the ancestral line of every v G B until it first 
hits {...,i}, and then assigning the type £. of this ancestor to v. This is well defined because 
under P^ the tree until it first hits {...,i} is independent of cij. These kernels X Ni ,i G Z are 
now used to construct the Gibbs measures. Due to the construction via product measures it is 
clear that they are consistent: If i < j, then for B C { j + 1, ...} x {1, ..., N}, 



\% ti (X v = x v ,v G B | X w = £ 2 , i + 1 < i w < j) = \%^ (X v = x v , v G 



B) 



Here, £ V £ 2 denotes the configuration which is equal to £ x on {...,i} and equal to £ 2 on 
{i + 1, ..., j}. So we can now define the Gibbs measures for our model: 

Definition A.l. A probability measure Atv on Sfj is called a n~ Gibbs measure if for all 

i G Z, for all finite subsets B C {i + 1, ..., } X {1, ..., N}, and for all xb G {a, A} B the mapping 
£ i— >• X N i{xb) is a version of the conditional probability 



\n{X\ 



B 



XB 



Pi 
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In other words, to sample from the Gibbs measure conditional on the past up to generation 
i, we first sample a G G 7jv according to Pj^, and assigning each X v ,i v > i + 1,1 < k v < N, 
its type according to the ancestors. It is clear that such measures exist, in fact, X P N defined 



in section 2.2 clearly is a fi— Gibbs measure for p G {0, 1}, and if G G Tn has infinitely many 
components almost surely, then for allp G [0, 1], the measures A^ are /i— Gibbs measures. Recall 
that this is the case if \x G T a with a < 1/2. This is the situation where the Gibbs measure 
characterization is interesting. 

A particularly useful feature of our model is that the only relevant Gibbs measures are of the 
form A^-. Note that the \x— Gibbs measures form a convex set, as can be seen easily, and we can 
characterise the extremal points of this set generalizing Proposition 1 of [3]. 

Proposition A. 2. Assume fi G T a . 

(a) Let a G (0,1/2). For each fixed N, for each p G [0,1], there is precisely one extremal 
fi— Gibbs measure Ajy on Sfj such that \^{Xi^ = a) = p for all i G Z, 1 < k < N. 

(b) Let a G (1/2, oo]. The only extremal Gibbs measures are X a N and X N . For p G (0,1), the 
measures X P N are given by X P N = pX^ + (1 — p)X] v . 



The proof of Proposition A. 2 follows closely the Proposition 1 of [3J, and we refer the reader 



to this work for details. Note that part (b) follows immediately from Theorem 2.2, as this 
implies that all individuals have the same type almost surely. The crucial step in the proof of 
part (a) of the proposition is the following Lemma. 

Lemma A. 3. Let X be a extremal fj, — Gibbs measure. Then there exist p G [0, 1] such that for 
all v = {i v , k v ) G Vn 

lim X(X V = a\a- m ) = p X — a.s. 

771— »0O 

Proof. For fixed v, the existence of the limit follows from the backward martingale conver- 
gence theorem, see [5], page 233, and the fact that it is constant follows from the tail triviality 
of extremal Gibbs measures. It remains to prove that it is independent of v. For this we couple 
the ancestral lines of two individuals v\ and V2 as in [3] in as far as their i— coordinate (the 
generations) is concerned, and concernig the £;— coordinate, that is, the label of the individual 
among the N individuals per generation, we simply couple them completely, which does not 
change the law of the process. Hence the proof of [3j goes through with only minor changes. □ 



For the rest of the proof of Proposition A. 2 see [3]. The main idea is as follows: For any 



finite set of individuals, there exists a (random) time T before which the ancestral lines don't 



meet. This time is finite a.s., and in view of Lemma A. 3 there exists p G [0, 1] such that the 
ancestors alive just after time T get their types independently with probability between p — e 
and p + E. This then implies that A = X P N , which, as we recall, conditional on G G T/v is induced 
by the product Bernoulli measure on the components of G with success parameter p. 



Acknowledgments The authors wish to thank an anonymous referee for making valuable 
suggestions which improved the presentation of the results considerably. 

References 

[1] N.H. Bingham, C.M. Goldie, J.L. Teugels, Regular variation, Cambridge Universtiy Press, 
1987. 

17 



[2] R. J. Cano, M. K. Borucki, Revival and identification of bacterial spores in 25- to 40- 
million-year-old Dominican amber, In: Science. Bd. 268, Nr. 5213 (1995), 1060-1064. 

[3] S. Ethier, T. Kurtz, Markov processes: characterization and convergence, Wiley, 1986 

[4] A. Hammond, S. Sheffield, Power law Polya's urn and fractional Brownian Motion, 
|arXiv:0903.l284> 3. To appear in Prob. Theory Rel. Fields. 

[5] J. Jacod, P. Protter, Probability essentials, Second Edition, Springer 2003. 

[6] I. Kaj, S. Krone, M. Lascoux, Coalescent theory for seed bank models, J. Appl. Prob. 38, 
285-300 (2001). 

[7] D. A. Levin, The seed bank as a source of genetic novelty in plants, American Naturalist 
135, 563-572 (1990). 

[8] T. Lindvall, On Coupling of Discrete Renewal Processes, Z. Wahrsch. verw. Gebiete 48, 
57-70 (1979). 

[9] T. Lindvall, Lectures on the coupling method, Wiley 1992. 

[10] M. Mohle, A convergence theorem for Markov chains arising in population genetics and the 
coalescent with selfing, Adv. Appl. Prob. 30, 493-512 (1998). 

[11] A. Tellier, S. J. Y. Laurent, H. Lainer, P. Pavlidis, W. Stephan, Inference of seed bank 
parameters in two wild tomato species using ecological and genetic data, PNAS, Vol. 108 
No. 41. 17052-17057 (2011). 

[12] R. Vitalis, S. Glemin, I. Oliviere, When genes got to sleep: The population Genetic Conse- 
quences of Seed Dormacy and Monocarpic Perenniality, The American Naturalist 163, no. 
2 (2004). 

[13] S.Yashina, S. Gubin, S. Maksimovich,A. Yashina, E. Gakhova, D. Gilichinsky, Regeneration 
of whole fertile plants from 30,000-y-old fruit tissue buried in Siberian permafrost, PNAS, 
Vol. 109 No. 10, 4008-4013 (2012). 



18 



